PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen06g020980.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family HD-ZIP
Protein Properties Length: 298aa    MW: 33813 Da    PI: 7.8915
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen06g020980.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox59.84.5e-19128182256
                       T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       rk+ ++tkeq  +Le+ F+ +++++ +++  LAk+lgL  rqV vWFqNrRa+ k
  Sopen06g020980.1 128 RKKLRLTKEQSAVLEDSFKDHHTLNPKQKLALAKRLGLRPRQVEVWFQNRRARTK 182
                       678899***********************************************98 PP

2HD-ZIP_I/II126.89.5e-41128217191
       HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreel 91 
                       +kk+rl+keq+++LE+sF+ +++L+p++K +la++Lgl+prqv+vWFqnrRARtk+kq+E+d+e+Lkr++++l++en+rL+kev+eLr +l
  Sopen06g020980.1 128 RKKLRLTKEQSAVLEDSFKDHHTLNPKQKLALAKRLGLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCVENLTDENRRLQKEVQELR-SL 217
                       69*************************************************************************************9.55 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF046181.0E-292101IPR006712HD-ZIP protein, N-terminal
SuperFamilySSF466891.0E-18119185IPR009057Homeodomain-like
PROSITE profilePS5007117.022124184IPR001356Homeobox domain
SMARTSM003891.1E-17126188IPR001356Homeobox domain
CDDcd000862.00E-16128185No hitNo description
PfamPF000462.0E-16128182IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.603.0E-18128180IPR009057Homeodomain-like
PRINTSPR000313.0E-5155164IPR000047Helix-turn-helix motif
PROSITE patternPS000270159182IPR017970Homeobox, conserved site
PRINTSPR000313.0E-5164180IPR000047Helix-turn-helix motif
CDDcd146860.00456177216No hitNo description
Gene3DG3DSA:1.20.5.1708.2E-4181217No hitNo description
SMARTSM003402.3E-25184227IPR003106Leucine zipper, homeobox-associated
PfamPF021834.3E-10184218IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0010582Biological Processfloral meristem determinacy
GO:0048467Biological Processgynoecium development
GO:0080127Biological Processfruit septum development
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
GO:0043621Molecular Functionprotein self-association
Sequence ? help Back to Top
Protein Sequence    Length: 298 aa     Download sequence    Send to blast
MMMGKEDLGL SLSLNFPAEK TTTTINLISP PPSSSFNDNY WTTHPPFPHS SSDRNMETRS  60
FLKGIDVNRM PAMAAEEEEG GVSSPNSTIS SLSGNKRSER EGNCTEENEM ERASSRGISD  120
EEDGETCRKK LRLTKEQSAV LEDSFKDHHT LNPKQKLALA KRLGLRPRQV EVWFQNRRAR  180
TKLKQTEVDC EFLKRCVENL TDENRRLQKE VQELRSLKHS PQFYMQMTPP TTLTMCPSCE  240
RVATGPTNTP VNIPPHRVGP PHQHHQPMPL NMWDPSSTPI SQGHYGQVDT YPSLARQK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1176184RRARTKLKQ
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00050PBMTransfer from AT4G17460Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754450.0HG975445.1 Solanum pennellii chromosome ch06, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015077450.10.0PREDICTED: homeobox-leucine zipper protein HAT4
SwissprotP466002e-97HAT1_ARATH; Homeobox-leucine zipper protein HAT1
TrEMBLK4C6R30.0K4C6R3_SOLLC; Uncharacterized protein
STRINGSolyc06g060830.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA11182485
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G17460.12e-95Homeobox-leucine zipper protein 4 (HB-4) / HD-ZIP protein